Journals
  Publication Years
  Keywords
Search within results Open Search
Please wait a minute...
For Selected: Toggle Thumbnails
Footnote Identification within a PDF Document
LI Sida, GAO Liangcai, TANG Zhi, YU Yinyan
Acta Scientiarum Naturalium Universitatis Pekinensis    2015, 51 (6): 1017-1021.   DOI: 10.13209/j.0479-8023.2015.087
Abstract1279)            Save

A robust method of identifying and linking footnote and its reference in the text is proposed to solve the footnote recognition problem. Novel features of the footnote, including page layout, font information, lexical and linguistic features, are utilized for the task. Clustering is adopted to handle the features which vary in different kinds of documents but stable within one document so that the process of identification is adaptive with document types. In addition, this method leverages results from the matching process to provide feedback to the identification process and further improves the algorithm accuracy. The primary experiments in real document sets show that the proposed method is promising to identify footnote in a PDF document.

Related Articles | Metrics | Comments0